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SYSTEM AND METHOD OF SUPPORTING TRANSPORT AND 
PLAYBACK OF SIGNALS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. provisional application 
5 Serial No. 60/540,161 filed January 29, 2004 and U.S. provisional application serial 
No. 60/598,241 filed August 3, 2004, the disclosures of which are hereby 
incorporated in its entirety. 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

10 

The present invention relates to methods and systems of supporting 
transport and playback of signals. 

2. Background Art 

Cable system operators and other operators providing video and audio 
15 (AV) signals to customers typically employ baseline architectures to transport the 
signals from a headend unit or similar feature to the customer equipment (CE), 
whether such signals are related to broadcast television, video on demand (VOD), 
or other services. The baseline architecture multiplexes the AV signals into an AV 
only transport that is modulated and transmitted over cable or similar transmission 
20 means to the CE. One common example relates to cable system that packet ize AV 
signals for transport in MPEG-2 transport streams. 

The ability to support bi-directional data communication with the CE 
and a network, such as the internet, is becoming more advantageous as CE becomes 
more advanced and capable of manipulating data signals. In the past, architectures 
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supporting bi-directional data communication included a data communication unit 
separate from the headend unit to multiplex data signals into a data only transport. 
One common example relates to cable modem termination systems (CMTSs) which 
are typically located remotely from headends and are configured to packetize and 
5 transport data signals in data over cable services interface specification (DOCSIS) 
transports. 

The AV transport and the data transport are separately delivered to 
the CE over cable or other communication mediums. 

SUMMARY OF THE INVENTION 

10 One non-limiting aspect of the present invention relates to a reference 

model configured for transporting video, audio, and data payloads over a common 
transport stream, such as transport stream associated with a data over cable services 
interface specification (DOCSIS). 

In accordance with one non-limiting aspect of the present invention, 
15 the reference model may include an application layer defining creation of the video, 
audio, and data signals, a link layer defining multiplexation of the video, audio, and 
data signals into the common transport, and a physical layer defining transportation 
of the common transport over the cable system. 

The link layer may define multiplexing of the signals based on data 
20 over cable services interface specifications (DOCSIS), such as by specifying data 
packets with packet identifier (PID) OxlFFE and without an associated adaptation 
field whereas the audio and video packets are identified with PIDs other than those 
having the OxlFFE designation and are associated with an adaptation field for 
decoder synchronization. 

25 The reference model may further include a transport layer defining 

management of the video, audio, and data signals and a network layer defining 
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transmission of the video, audio, and data signals between networks. The transport 
layer may define management based on real-time protocols (RTP), user datagram 
protocols (UDP), transmission control protocols (TCP), and/or MPEG-2 protocols, 
and the network layer may define transmission based on internet protocols (IP). 

5 In accordance with one non-limiting aspect of the present invention, 

the reference model may support transport and playback of AV packets according 
to a number of different protocols, including a baseline architecture where MPEG 
AV streams are carried directly over MPEG-2 transport and data packets are carried 
separately over a DOCSIS MPEG-2 transport such that different transport streams 
10 are associated with data and A/V packets, an extended mode 1 where MPEG-2 AV 

transport packets are combined with DOCSIS data packets in a single DOCSIS 

> 

MPEG-2 transport stream, and an extended mode 2 where MPEG-2 AV transport 
packets in RTP payloads over UDP over IP over DOCSIS are combined with 
DOCSIS data packets in a single DOCSIS MPEG-2 transport stream with the ability 
15 to also use other real-time protocols instead of RTP. 

One non-limiting aspect of the present invention relates to a 
subscriber video device (SVD) configured for receiving video, audio, and data 
signals in a common transport stream and outputting the received signals for 
playback on a media output device. In accordance with one non-limiting aspect of 
20 the present invention, the SVD may receive and playback signals from architectures 
associated with baseline or extended modes 1 and 2. 

In accordance with one non-limiting aspect of the present invention, 
the SVD may be configured to support playback of AV signals packetized for 
delivery in an AV only transport associated with AV packets and an integrated 

25 transport associated with AV and data packets. In accordance with one non-limiting 
aspect of the present invention, the AV packets of the integrated transport may only 
be recovered therefrom through data processing. The SVD may include a switch 
configured to separate packets associated with AV only transport from packets 
associated with the integrated transport, a data processor in communication with the 

30 switch and configured to separate AV related packets from data related packets 
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included within the integrated transport, and a demultiplexer in communication with 
the switch and data processor configured to demultiplex AV packets outputted 
therefrom. 

In accordance with one non-limiting aspect of the present invention, 
5 the SVD may be configured to support digital television (DTV) signals packetized 
according MPEG-2 and DOCSIS protocols. The SVD may include a switch 
configured to separate packets associated with MPEG-2 protocols from packets 
associated with the DOCSIS protocols, a DOCSIS data processor in communication 
with the switch and configured to receive the packets associated with the DOCSIS 
10 protocols and to separate AV related packets from data related packets included 
therein, and a demultiplexer in communication with the switch and data processor 
and configured to demultiplex AV packets outputted therefrom. 

The SVD may include a decoder in communication with the 
demultiplexer configured to decode AV payloads for output to a video port and an 
15 audio port. The decoder may be configured for decoding any number of payloads, 
including those associated with MPEG-2 protocols and advanced video compression 
(AVC) protocols, such as MPEG-4 and H.264. 

The above features and advantages, along with other features and 
advantages of the present invention, are readily apparent from the following detailed 
20 description of the invention when taken in connection with the accompanying 
drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 illustrates a reference model in accordance with one 
25 non-limiting aspect of the present invention; and 

FIGURE 2 illustrates a flexible subscriber video device (SVD) in 
accordance with one non-limiting aspect of the present invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT(S) 

FIGURE 1 illustrates a reference model 10 in accordance with one 
non-limiting aspect of the present invention. The reference model 10 is configured 
for transporting audio, video, and data payloads over a common transport. In more 
5 detail, the reference model may support the combining of packetized audio and 
video (AV) signals, such as those associated with digital television, and packetized 
data signals, such as those associated with high speed Internet communications and 
the like, including web services and application data from servers, over a common 
transport stream. 

10 The reference model is advantageous for use in cable systems and 

other environments where it may be desirable to transport multimedia signals (i.e. 
audio, video, and data) signals over a common transport, such as telecommunication 
systems. Of course, the present invention contemplates its application in any 
number of environments and is not intended to limited to cable and 

15 telecommunication systems. 

The reference model 10 may include an application layer 14 that 
ultimately results in the creation of video, audio, and data signals. In more detail, 
the application layer relates to functions and operations through which a user or 
other operator interfaces to create or manipulate incoming signals. In accordance 
20 with one non-limiting aspect of the present invention, the application layer 14 
includes an AV application relating to AV signals and a data application relating to 
data signals, i.e., signals originated from non-AV sources. 

Of course, the present invention contemplates the application layer 
14 including applications relating to other types of signals, such as a streaming video 
25 and data application relating to packetizing digital video stored on a server and other 
data associated with the streamed video and video services, such as program specific 
information / system information (PSI/SI). In more detail, the source may include 
a program specific information / system information (PSI/SI) generator (not show) 
which receives instructions from providers, vendors, and/or operators at the source 
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or in communication therewith for associating data and information signals with the 
broadcast television signals. For example, the PSI generator may be configured to 
provide information regarding specific programs, such as detailed program 
descriptions, associated program authorizations, conditional access features, 
5 interactive tags, program (channel) listing, and the like. Likewise, the SI generator 
may be configured to provide less specific information, such as elementary stream 
program identifiers (such as those required to select specific programs from a 
multiple program transport stream), electronic programming guide (EPG) 
information, channel maps (in cable systems for example, each provider may be 
10 assigned a channel in a channel map which the customer may tune to with their CE 
to locate the associated television program or other service) content navigation 
information, tuning information (such as a frequency map of the system), 
presentation information, and the like. 

The reference model 10 may include a transport layer 18 for defining 
15 management of signals associated with the applications. In more detail, the 
transport layer defines protocols and procedures for exchanging and managing the 
signals, including error recovery and flow control, such as through real-time 
protocols (RTP), user datagram protocols (UDP), transmission control protocols 
(TCP), and/or MPEG-2 protocols. 

20 RTP is an Internet-standard protocol included within the transport 

layer to facilitate the transport of real-time data with robust management, including 
audio and video associated with the AV applications. It can be used for 
media-on-demand as well as interactive services such as Internet telephony. RTP 
consists of a data and a control part commonly referred to a Real-time Transport 

25 Control Protocol (RTCP). The data part of RTP is a thin protocol providing support 
for applications with real-time properties such as continuous media (e.g., audio and 
video), including timing reconstruction, loss detection, security and content 
identification. RTP has important properties of a transport protocol, including 
offering quality-of-service feedback from receivers to the multicast group as well as 

30 support for the synchronization of different media streams, source identification and 
support for gateways like audio and video bridges as well as multicast-to-unicast 
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translators, support for real-time conferencing of groups of any size within an 
internet, end system operability, demultiplexing capabilities, hooks for adding 
reliability, where appropriate, and flow/congestion control also referred to as 
application-level framing. 

5 Of course, the present invention contemplates other applications of 

RTP, such as making RTP transport-independent so that it could be used over CLNP 
(Connectionless Network Protocol), IPX (Internet Packet eXchange) or other 
protocols and directly over AAL5/ATM (ATM Adaptation Layer 5 over 
Asynchronous Transfer Mode protocol). In more detail, ISO CLNP is a datagram 

10 network protocol that provides fundamentally the same underlying service to a 
transport layer as IP. CLNP provides essentially the same maximum datagram size, 
and for those circumstances where datagrams may need to traverse a network whose 
maximum packet size is smaller than the size of the datagram, CLNP provides 
mechanisms for fragmentation (data unit identification, fragment/total length and 

15 offset). Like IP, a checksum computed on the CLNP header provides a verification 
that the information used in processing the CLNP datagram has been transmitted 
correctly, and a lifetime control mechanism ("Time to Live") imposes a limit on the 
amount of time a datagram is allowed to remain in the internet system. As is the 
case in IP, a set of options provides control functions needed or useful in some 

20 situations but unnecessary for the most common communications. 

IPX is the networking protocol used in Novell networks. It is an 
unreliable connectionless internet work protocol that passes packets transparently 
from source to destination, even if the source and destination are on different 
networks. IPX is functionally similar to IP except that it uses 12-byte addresses 

25 instead of 4-bytre addresses. ATM is the underlying technology behind the 
Broadband Integrated Services Digital Network (B-ISDN) being built by the 
telephone companies to offer video on demand, live television, electronic mail, 
music services, high-speed data services, etc. ATM transmits all information in 
small fixed-size packets called cells. The cells are 53 bytes long (4 bytes header, 

30 48 bytes payload). The ATM layer provides flow control and routing. The AAL 
is responsible for breaking up the larger application messages at the source into the 
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small, fixed size cells of the ATM layer and then reassembling them at the 
destination into their original form. 

In addition to the foregoing real-time management, the services of the 
transport layer 18 may include UDP transport protocol which is an unreliable, 
5 connectionless protocol for applications that may desire less management than 
TCP's sequencing or flow control. It is also widely used for one-shot, client-server 
type request-reply queries and applications in which prompt delivery is more 
important than accurate delivery, such as transmitting speech or video. 

In accordance with one non-limiting aspect of the present invention, 
10 signals associated with the data applications may not be processed according to RTP, 
MPEG-2 or other similar real-time protocols. This is done, at least in part, because 
the data applications typically do not require real-time delivery and are therefore not 
sensitive to system delay and jitter, thus permitting the use of standard TCP 
transport protocols. In contrast, however, the AV signals may require more robust 
15 management, such as if the AV signals are to be transported over a packet switching 
network that should not be subject to extreme network delay and jitter, and as such, 
are preferably associated with one of the above described real-time protocols. 

In more detail, TCP is a reliable connection oriented protocol that 
allows a byte stream originating in one machine to be delivered without error to any 
20 other machine in the Internet. It fragments the incoming byte stream into discrete 
messages and passes each one onto the Internet layer. At the destination, the 
receiving TCP process reassembles the received messages into the output stream. 
TCP also handles flow control to make sure a fast sender cannot swamp a slow 
receiver with more messages than it can handle. 

25 The reference model 10 may include a network layer 22 for defining 

transmission of the signals processed according to the transport layer 18 so as to 
permit the transmission of the signals between networks. In more detail, the 
network layer 22 controls signals transmission between computers, address and 
routing between networks, such through internet protocols (IP). IP is the Internet's 
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official packet format and protocol and uses IP protocols to route and deliver IP 
packets to correct destinations. 

The reference model 10 may include a link layer 26 defining 
multiplexation of the video, audio, and data signals into a common transport. The 
5 link layer 26 may include a MAC layer 28, such as a DOCSIS MAC, to allocate 
bandwidth, resolve contention, and guarantee the proper quality of service (QoS). 
In more detail, a DOCSIS MAC layer may be used to arbitrate access to the system 
resources, especially access to the shared downstream and uplink medium and 
transforms upper layer structures such as IP datagrams from the network layer into 

10 DOCSIS frames and then breaks up those frames so that they fit within the 
fixed-length MPEG-2 transport packets. In this manner, DOCSIS frames may begin 
anywhere within the MPEG packet and may span several MPEG packets wherein 
the first byte of the MPEG payload is typically a 'pointer field' that points to the 
start of the DOCSIS frame within an MPEG packet, and in some cases, stuffing 

15 bytes (OxFF) are used to fill the gap between DOCSIS MAC frames. 

In addition, the link layer 26 may include a transport layer 30 for 
multiplexing the AV packets and data packets into a common transport stream. The 
multiplexing may take place between data signals and AV signals processed 
according the transport and network layers 18, 22, or in accordance with one 

20 limiting aspect of the present invention, with AV signals which bypass the transport 
and network layers 18, 22, In more detail, for some AV signals, it may be 
unnecessary to process AV signals for packetized transport over wide area networks 
and therefore unnecessary to include the features associated with the 
above-described transport and network layers 18, 22, i.e., RTP, UDP, TCP, IP, and 

25 MAC, which are commonly used if the AV signals are transported over a packet 
switching network. Rather, the AV signals may be communicated directly to a 
multiplexer through non-packet switching means to be multiplexed with the data 
signals and/or other AV signals processed through the transport and link layers. 

In accordance with one non-limiting aspect of the present invention, 
30 the transport layer 30 may combine the signals into a MPEG-2 transport stream of 
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188 byte MPEG packets composed of a 4 byte header and 184 byte payload. In 
accordance with one non-limiting aspect of the present invention, the common 
transport stream differentiates AV packets from data packets using DOCSIS 
procedures and protocols, such as those included in a DOCSIS transmission 
5 convergence sub-layer. In more detail, this includes identifying data packets by a 
packet identifier (PID) OxlFFE in the MPEG-2 transport header and without an 
associated adaptation field whereas the AV packets are identified with PIDs other 
than those having the OxlFFE designation and are associated with an adaptation field 
for decoder synchronization. Of course, the present invention is not intended to be 
10 limited to the use of the DOCSIS downstream transmission convergence sub-layer 
and contemplates the use of other packet identifying logic for differentiating between 
AV packets and DOCSIS data packets. 

The reference model 10 may include a physical layer 34 defining 
transportation of the common transport over the cable system. In more detail, the 
15 physical layer 34 defines processes and procedures for transporting the common 
transport over hybrid fiber cables, such as through quadrature amplitude modulation 
(QAM), or through other means, such as wireless networks (terrestrial and 
extraterrestrial) 

As described above, the reference model 10 supports transporting the 
20 AV packets according to a number of different protocols, including a baseline 
architecture where MPEG AV streams are carried directly over MPEG-2 transport 
and data packets are carried separately over a DOCSIS MPEG-2 transport such that 
different transport streams are associated with data and A/V packets, an extended 
mode 1 where MPEG-2 AV transport packets are combined with DOCSIS data 
25 packets in a single DOCSIS MPEG-2 transport stream, and an extended mode 2 
where MPEG-2 AV transport packets in RTP payloads over UDP over IP over 
DOCSIS are combined with DOCSIS data packets in a single DOCSIS MPEG-2 
transport stream with the ability to also use other real-time protocols instead of RTP. 

FIGURE 2 illustrates a flexible subscriber video device (SVD) 40 in 
30 accordance with one non-limiting aspect of the present invention. The SVD 40 may 
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be integrated into a settop box (STB) or other feature for receiving and manipulating 
signals from a cable provider, such as a digital video recorder (DVR), media 
terminal adapter (MTA), outlet digital adapter (ODA), and the like. The present 
invention contemplates the use of the SVD 40 in other applications besides cable 
5 system and is not intended to be so limited. 

Figure 2 illustrates the SVD 40 configured for supporting signals 
carried in different transports. In more detail, the SVD 40 may be configured to 
support AV only transports, such as those associated with the baseline architectures, 
and integrated transports having both AV packets and data packets, such as those 
10 associated with the reference model (extended modes 1 and 2) described above. 

In accordance with one non-limiting aspect of the present invention, 
the SVD 40 may be a settop box (STB) unit configured to receive digital television 
signals, data signals, out-of-band messaging signals, and the like from a cable 
service provider 44. In particular, the SVD 40 may be configured to receive 
15 MPEG-2 AV transport streams, MPEG-2 DOCSIS transport steams having AV 
packets and data packets (DOCSIS pay loads), and other common/integrated 
transports having separately identified data and AV packets. The SVD 40 may be 
integrated in any number of devices to provide a flexible architecture capable of 
supporting multiple transport protocols. 

The SVD 40 may include a tuner and demodulator to tune to and 
demodulate signals provided thereto in order to recover a transport stream. In one 
non-limiting aspect of the present invention, tuner and demodulator units 46 and 48 
are configured to select a CATV 6 MHz RF carrier in a band between 52 an 1002 
MHz and to demodulate the quadrature amplitude modulated signals. The pair of 
units 46, 48 are provided to supported simultaneous playback and record, however, 
the present invention contemplates the use of more less of the units 46, 48. The unit 
46, 48 output a packetized transport stream to a switch 54 configured to separate AV 
packets from integrated packets (i.e., those having both AV and non-AV data 
packets such as DOCSIS pay loads). 



20 
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A data processor 58, such as a DOCSIS processor, communicates 
with the switch 54 to receive the integrated packets and processes the packets to 
separate AV related packets from data related packets. For example, the data 
processor 54 may recover AV related packets included within an integrated transport 
5 having both AV and non-AV data packets packetized according to data transmission 
protocols, such as DOCSIS, which prevent such AV packets from being recovered 
by a demultiplexer 62, such as those included therewith according the first and 
second reference models described above. 

The demultiplexer 62 communicates with the switch 54 and the data 
10 processor 58 to receive the AV packets outputted therefrom, i.e. those outputted 
directly from the switch 54 and those recovered by the data processor 58. The 
demultiplexer 62 locates the AV payloads associated therewith according to 
information included within the packets and outputs the signals to a decoder 64. 
The decoder 64 decodes the AV payloads, which may be payloads compressed 
15 according to any number of compression protocols, such as advanced video 
compression protocols, like MPEG-4 and H.264, and legacy compression 
protocols, like MPEG-2. 

The decoder 64 outputs the decoded AV payloads to a video port 68 
and an audio port 70. A media output device 74, such as a television, digital video 

20 recorder (DVR), or other feature, may be connected to the ports for recording or 
playback of the AV signals. A microprocessor 78 or other data packet processing 
unit may communicate with the data processor to manipulate the data packets 
included therewith, such as to facilitate communication with a remote network and 
the like. The microprocessor, 78 for example, may be associated with a cable 

25 modem or other bi-directional device such that it may be further configured to 
communication upstream signals through the data processor 58, which packetizes 
the signals for upstream network communication. 

While embodiments of the invention have been illustrated and 
described, it is not intended that these embodiments illustrate and describe all 
30 possible forms of the invention. Rather, the words used in the specification are 
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words of description rather than limitation, and it is understood that various changes 
may be made without departing from the spirit and scope of the invention. 
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